Linear prediction incorporating simultaneous masking
نویسندگان
چکیده
Whilst linear prediction is the cornerstone of most modern speech coders, few of these coders incorporate the perceptual characteristics of hearing into the calculation of the linear predictor coefficients (LPCs). This paper proposes a method of incorporating simultaneous masking into the calculation of the LPCs. This modification requires only a modest increase in computational complexity and results in the linear predictor removing more perceptually important information from the input speech signal. This results in a filter that better models the formants of the input speech spectrum. The net effect is that an improvement in quality is achieved for a given bit rate or alternately a bit rate reduction can be achieved while maintaining perceived quality. These results have been confirmed through subjective listening tests. Disciplines Physical Sciences and Mathematics Publication Details This paper originally appeared as: Lukasiak, J, Burnett, IS, Chicharo, JF & Thomson, MM, Linear prediction incorporating simultaneous masking, ICASSP '00. Proceedings. 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing, 5-9 June 2000, vol 3, 1471-1474. Copyright IEEE 2000. This conference paper is available at Research Online: http://ro.uow.edu.au/infopapers/218 LINEAR PREDICTION INCORPORATING SIMULTANEOUS MASKING J. Lukasiak, IS. Burnett, J . F. Chicharo, M.M. Thomson * Whisper Laboratories, TITR University of Wollongong Wollongong, NSW, Australia, 2522 *Motorola Australian Research Centre, Botany, NSW, Australia, 201 9
منابع مشابه
Low rate speech coding incorporating simultaneously masked spectrally weighted linear prediction
Linear prediction (LP) is the cornerstone of most modern speech compression algorithms. Previously it has been shown that incorporating a weighting function based on the simultaneous masking property of the ear into the calculation of the LP coefficients (SMWLPC) allows the filter to better model the unmasked sections of the input spectrum. This paper conducts a detailed analysis of the impleme...
متن کاملPerceptual wavelet packet audio coder
Traditional wavelet packet audio compression algorithms do not utilize the temporal masking properties of the human auditory system, relying instead on simultaneous masking models. This paper presents the design and implementation of a perceptual wavelet audio coder by incorporating temporal and simultaneous masking models. The efficiency of the encoder was assessed based upon the number of bit...
متن کاملMasking by inaudible sounds and the linearity of temporal summation.
Many natural sounds, including speech and animal vocalizations, involve rapid sequences that vary in spectrum and amplitude. Each sound within a sequence has the potential to affect the audibility of subsequent sounds in a process known as forward masking. Little is known about the neural mechanisms underlying forward masking, particularly in more realistic situations in which multiple sounds f...
متن کاملSingle channel speech enhancement by frequency domain constrained optimization and temporal masking
A speech enhancement algorithm is proposed that exploits the masking properties of the human auditory system. The enhancement is formulated as a frequency domain constrained optimization problem. The noise components of the noisy speech are suppressed by a gain function subject to the constraint that both the signal distortion and residual noise should fall below the masking thresholds. Tempora...
متن کاملLinear and Nonlinear Processes in Temporal Masking
A number of masking phenomena can be modeled in terms of a linear auditory filter bank followed by a temporal integrator and a simple decision device based on the signal-to-masker ratio. Other aspects require the inclusion of a nonlinearity following linear filtering. The present article concentrates on aspects of non-simultaneous, or “temporal”, masking that cannot be explained by either model...
متن کامل